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Abstract C 4 photosynthesis represents a most remarkable case of convergent evolution of a 
complex trait, which includes the reprogramming of the expression patterns of thousands of genes. 
Anatomical, physiological, and phylogenetic and analyses as well as computational modeling 
indicate that the establishment of a photorespiratory carbon pump (termed C 2 photosynthesis) is a 
prerequisite for the evolution of C 4 . However, a mechanistic model explaining the tight connection 
between the evolution of C 4 and C 2 photosynthesis is currently lacking. Here we address this 
question through comparative transcriptomic and biochemical analyses of closely related C 3 , C 3 -C 4 , 
and C 4 species, combined with Flux Balance Analysis constrained through a mechanistic model of 
carbon fixation. We show that C 2 photosynthesis creates a misbalance in nitrogen metabolism 
between bundle sheath and mesophyll cells. Rebalancing nitrogen metabolism requires anaplerotic 
reactions that resemble at least parts of a basic C 4 cycle. Our findings thus show how C 2 
photosynthesis represents a pre-adaptation for the C 4 system, where the evolution of the C 2 system 
establishes important C 4 components as a side effect. 
DOI: 10.7554/el_ife.02478.001 



Introduction 

The dual-specific enzyme ribulose 1 ,5-bisphosphate carboxylase/oxygenase (Rubisco) catalyzes two 
opposing reactions — the carboxylation and the oxygenation of ribulose 1,5-bisphosphate. The former 
reaction yields 3-phosphoglycerate (3-PGA), whereas the latter produces 2-phosphoglycolate (2-PG). 
3-PGA is reduced to carbohydrates in the Calvin-Benson cycle and incorporated into biomass. 
However, 2-PG is toxic, which requires its removal by a metabolic repair pathway called photorespira- 
tion (Anderson, 1971; Bowes et al., 1971; Ogren, 1984; Leegood et al., 1995). In the photorespira- 
tory cycle, 2-PG is regenerated to 3-PGA, but it involves the release of formerly assimilated C0 2 and 
NH 3 , entails energy costs for the plants and reduces the efficiency of photosynthesis by up to 30% 
(Ehleringer et al., 1991; Bauwe et al., 2010; Raines, 2011; Fernie et al., 2013). Eight core enzymes are 
required for photorespiration, which in higher plants are located in the chloroplast, the peroxisome, 
and the mitochondrion {Bauwe et al., 2010; Figure 1A). The pathway rescues 3 A of the carbon, which 
would otherwise be lost through the oxygenase activity of Rubisco (Peterhansel et al., 2010; Fernie 
et al., 2013). Ammonia refixation in the chloroplast by the combined activities of glutamine synthase 
(GS) and glutamine oxoglutarate aminotransferase (GOGAT) is an integral part of photorespiration. 

In hot and dry environments and under low atmospheric C0 2 conditions, when the oxygenation 
activity of Rubisco is increased, the high rate of photorespiration becomes unfavorable for the plants 



Mallmann et al. eLife 2014;3:e02478. DOI: 10.7554/el_ife.02478 



1 of 23 



Research article 



Genomics and evolutionary biology | Plant biology 



eLife digest Environmental pressures sometimes cause different organisms to independently 
evolve the same traits. A dramatic example of this phenomenon, which is called convergent 
evolution, can be seen in the modes used by plants to convert carbon dioxide from the air into 
starch during photosynthesis. 

Early plants existed in an environment with high levels of carbon dioxide in the air. Over time, 
carbon dioxide levels decreased, so plants evolved more efficient types of photosynthesis to cope. 
A very efficient type of photosynthesis, called C 4 photosynthesis essentially represents a carbon 
dioxide concentration mechanism. It has evolved at least 62 times independently in 19 different 
families of flowering plants. 

Scientists have shown that a less advanced, low-efficiency version of photosynthetic carbon 
dioxide concentration, called C 2 photosynthesis, is a stepping-stone to C 4 photosynthesis. It is 
also known that the evolution of C 4 photosynthesis required changes to the expression patterns 
of thousands of genes, but the exact mechanism that leads from C 2 photosynthesis to C 4 
photosynthesis is not clear. 

To explore this in greater detail, Mallmann, Heckmann et al. studied plants from the genus 
Flaveria, which belongs to the same family as sunflowers and asters. Under identical greenhouse 
conditions, plants that use three different photosynthetic pathways — C 3 photosynthesis, 
C 4 photosynthesis, or an intermediate between the two — were grown and their gene expression 
patterns were compared. Computer simulations were used to model the metabolism of plants that 
relied on C 2 photosynthesis. 

Based on the modeling, it appears that C 2 photosynthesis shifts the balance of nitrogen 
metabolism between two types of cell that are critical to photosynthesis. To rebalance the nitrogen, 
several genes are expressed to trigger an ammonia recycling mechanism. The same genes are 
turned on during C 4 photosynthesis, and this recycling mechanism include parts of the C 4 process. 

The findings of Mallmann, Heckmann et al. suggest that the initial steps in C 4 photosynthesis 
evolved to prevent nitrogen imbalance. Over time, this mechanism was co-opted to become part of 
a more efficient form of photosynthesis, which may explain why so many different plants evolved 
from C 2 to C 4 photosynthesis. 
DOI: 10.7554/el_ife.02478.002 



(Sage, 2001, 2013). C 4 plants possess a mechanism that minimizes the oxygenase function of Rubisco 
and thereby reduces photorespiration and decreases the loss of carbon. C 4 photosynthesis is based on 
a division of labor between two different cell types, mesophyll and bundle sheath cells, which are 
organized in a wreath-like structure called 'Kranz Anatomy' (Haberlandt, 1904] Dengler and Nelson, 
1999). Atmospheric C0 2 is initially fixed in the mesophyll by phosphoenolpyruvate carboxylase (PEPC), 
and the resulting four-carbon compound is transported to the bundle sheath cells and decarboxylated 
by NADP/NAD malic enzyme or phosphoenolpyruvate carboxykinase {Hatch et al., 1975). Thereby 
C0 2 is concentrated at the site of the Rubisco in the bundle sheath cells {Hatch, 1987), outcompeting 
the molecular oxygen. As a consequence, photorespiration is drastically reduced as compared to 
C 3 plants, and C 4 plants are characterized by a high photosynthetic efficiency (Figure 1B). 

C 4 plants have evolved multiple times independently from C 3 ancestors. The evolution of C 4 photo- 
synthesis occurred at least 62 times in 19 different families of the angiosperms (Sage et al., 2011), 
implying a low evolutionary barrier towards expression of this trait. The analysis of recent intermediate 
species (Bauwe and Kolukisaoglu, 2003; Sage, 2004; Bauwe, 2011; Sage et al., 2012, 2013; 
Schulze et al., 2013) indicates that establishing a photorespiratory C0 2 pump was an early and impor- 
tant step in the evolution towards C 4 photosynthesis (Figure 1C). Since the two-carbon compound 
glycine serves as a transport metabolite, this photorespiratory C0 2 concentrating mechanism is also 
termed C 2 photosynthesis. Computational modeling of the evolutionary trajectory from C 3 to C 4 pho- 
tosynthesis indicated C 2 photosynthesis represented an evolutionary intermediate state (Heckmann 
et al., 2013; Williams et al., 2013) as well suggesting that C 2 photosynthesis is a prerequisite for the 
evolution of C 4 . However, it remained unclear if the evolution of C 2 photosynthesis fosters the evolu- 
tion of C 4 photosynthesis beyond providing a selection pressure to reallocate Rubisco to the bundle 
sheath. 
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Figure 1 . The genus Flaveria as a model organism to study C 4 evolution. Schematic view of the photorespiratory pathway (A), the NADP-ME type C 4 
pathway as it can be found in C 4 Flaveria species (B) and the C 2 photosynthesis pathway (C). (D) Phylogeny and physiological properties of selected 
Flaveria species. The phylogeny was redrawn according to McKown et al. (2005), C0 2 compensation points are taken from Ku et al. (1991), incorpora- 
tion of 14 C0 2 is from Moore et al. (1987) and the ratios of GLDP B (expressed in all chlorenchyma cells) and GLDP A (expressed in bundle sheath cells 
only) are from Schulze et al. (2013). (Abbreviations: AGT: serine glyoxylate aminotransferase; AlaAT: alanine aminotransferase; AspAT: aspartate 
aminotransferase; GDC: glycine decarboxylase complex; GGT: glutamate, glyoxylate-aminotransferase; GLYK: D-glycerate 3-kinase; GOX: glycolate 
oxidase; HPR: hydroxypyruvate reductase; MDH: malate dehydrogenase; NADP-ME: NADP dependent malic enzyme; PEPC: phosphoenolpyruvate 
carboxylase; PGLP: 2-phosphoglycerate phosphatase; PPDK pyruvate, phosphate-dikinase; RUBISCO: Ribulose-1 ,5-bisphosphat-carboxylase/-oxygenase; 
SHM: serine hydroxymethyltransferase; 2-OG: oxoglutarate; 2-PG 2-phosphoglycolate; 3-PGA: 3-phosphoglycerate; Gin: glutamine; Glu: glutamate; 
OAA: oxaloacetate; PEP: phosphoenolpyruvate; TP: triosephosphate). 
DOI: 10.7554/eLife.02478.003 



In the present study, we have used the genus Flaveria as a model system for investigating the tran- 
sition from C 2 to C 4 photosynthesis. To this end, we study a phylogenetic framework consisting of C 3/ 
C3-C4 intermediate, and C 4 species {Powell, 1978, Edwards and Ku, 1987, Ku et al., 1991) of this 
genus which rather recently evolved C 4 (Christin et al., 2011), focusing on genes encoding photores- 
piratory enzymes and other components of C 2 photosynthesis. The genus Flaveria contains three main 
phylogenetic groups, of which the first diverging group includes all C 3 Flaveria. Clade B contains seven 
C3-C4 intermediate species and the C 4 -like species F brownii. All C 4 Flaveria species belong to clade 
A, which also contains several C 4 -like species and the C 3 -C 4 intermediate F ramosissima (McKown 
et al., 2005; Figure 1D). We hypothesized that the analysis of species in the genus Flaveria combined 
with in silico modeling elucidates the evolutionary changes accompanying and following the estab- 
lishment of the C 2 pathway. To this end we simulated the metabolism of C 2 plants by coupling a mech- 
anistic model of C 3 -C 4 intermediate photosynthesis (von Caemmerer, 2000; Heckmann et al., 2013) 
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with a detailed modified stoichiometric model of C 4 photosynthesis (Dal'Molin et aL, 2010), and 
investigated the evolution of C 4 photosynthesis and photorespiration by following the changes in 
mRNA and protein abundance along the evolutionary path. 

RNA and protein amounts of the majority of the photorespiratory enzymes were reduced in C 4 as 
compared to C 3 species. In contrast, photorespiratory mRNA and protein amounts did not decrease in 
the C3-C4 intermediate species but were mostly equal or even higher than in the C 3 species, demon- 
strating that the establishment of the photorespiratory C0 2 pump in the genus Flaveha relies on coor- 
dinated changes in the expression of all core photorespiratory enzymes. Metabolic modeling in 
combination with comparisons of transcript abundances in the different Flaveha species strongly indi- 
cates that introduction of C 2 photosynthesis has a direct impact on the nitrogen metabolism of the 
leaf. Its implementation necessitates the parallel establishment of components of the C 4 cycle to cope 
with these changes in refixation of photorespiratory nitrogen. Based on these results, we predict a 
mechanistic interaction between C 4 and C 2 photosynthesis. 

Results 

Selection of Flaveria species, cultivation of plant material and 
experimental design 

To study the evolution of the expression of photorespiratory and C 4 cycle genes during the transition 
from C 3 to C 4 photosynthesis in the genus Flaveha, nine species reflecting the evolutionary trajectory 
taken were selected, including two C 3 (F. robusta and F pringlei), two C 4 {F bidentis and F thnervia), 
and five C 3 -C 4 intermediate species {Figure 1D). According to their C0 2 compensation points and the 
percentage of carbon initially fixed into malate and aspartate, F chloraefolia and F pubescens were 
earlier classified as type I C 3 -C 4 intermediates. F anomala and F ramosissima belong to the type II 
C 3 -C 4 intermediates and F brownii is classified as a C 4 -like species (Edwards and Ku, 1987; Moore et 
al., 1987; Cheng etaL, 1988; Ku etal., 1991). Type I C 3 -C 4 intermediates are defined as solely relying 
on the photorespiratory C0 2 concentration cycle whereas a basal C 4 cycle activity is present in type II 
C 3 -C 4 intermediates species. C 4 -like species exhibit much higher C 4 cycle activities but lack complete 
bundle sheath compartmentation of Rubisco activity (Edwards and Ku, 1987). 

Four independent experiments with plants grown during different seasons were performed to 
identify differences between the species that are dependent on their different modes of photosyn- 
thesis and independent of environmental influences. For each experiment the plants were seeded 
concurrently and grown side-by-side under greenhouse conditions. The second and fourth visible 
leaves from the top of all nine species were harvested at noon on the same day for transcript and 
protein analysis. Plants for experiment one were harvested in September 2009, for experiment two 
in June 2010, for experiment three in October 2010 and for experiment four in April 2011. The 
amounts of the core photorespiratory and C 4 enzymes were assessed by immunoblotting using spe- 
cific antibodies raised against synthetic peptides or recombinant proteins. The abundances of the 
corresponding RNAs as well of C 4 cycle associated transcripts were quantified by total transcriptome 
sequencing. 

The transcript profiles of the individual Flaveria species were 
comparable throughout all four experiments 

The transcriptomes of the different Flaveha species were sequenced by lllumina technology following 
standard procedures. In total, close to 200 Gb of raw sequence data were produced. After filtering of 
low quality reads 30 to 58 million reads per species and experiment were quantified (Figure 2 — source 
data 1). In a cross species approach, we mapped the sequences onto the minimal set of Arabidopsis 
thaliana coding sequences using the BLAST-like alignment tool BLAT (Kent, 2002) as described previ- 
ously (Gowik et aL, 2011) (Figure 2 — source data 2, data available from the Dryad Digital Repository: 
1 0.5061 /dryad. q827h). We were able to align approx. 50% of our reads to the Arabidopsis transcripts. 
This is lower as compared to a similar approach using 454 sequencing (Gowik et aL, 2011) and likely 
due to the shorter read length of the lllumina compared to the 454 reads. To overcome the low map- 
ping efficiency, the leaf transcriptomes of Fiaveha species were assembled de novo based on 454 
(Gowik et aL, 2011) and lllumina reads (this study). Among the contigs from F robusta, we identified 
full-length transcripts for all photorespiratory and C 4 genes in the focus of the present study and used 
these for further read mapping and detailed analysis. 
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To evaluate the variation between the four independent experiments, we performed hierarchical 
sample clustering and a principal component analysis of the transcript profiles derived from read map- 
ping on the minimal set of Arabidopsis coding sequences. Hierarchical sample clustering using Pearson 
correlation and average linkage clustering shows that the transcript profiles of all Flaveria species were 
quite similar in all four experiments since the samples cluster strictly species-wise (Figure 2A). The 
transcriptome patterns are influenced by the photosynthesis type and the phylogenetic relationships 
of the different species. The two C 4 species, both belonging to clade A, cluster together as do the two 
C 3 species that belong to the basal Flaveria species. Within the C 3 -C 4 intermediates the two more 
advanced intermediates F. ramosissima and F anomala cluster together, the only pattern which contra- 
dicts phylogenetic proximity since F ramosissima belongs to clade A and F anomala belongs to clade 
B. The last cluster consists out of the C 3 -C 4 intermediates F chloraefolia and F pubescens, and the 
C 4 -like species F brownii. 

Principle component analysis supports the results of the hierarchical clustering. The samples are 
mainly separated by photosynthesis type and phylogenetic relationships with the two intermediate 
species from different phylogenetic trajectories again forming a tight cluster {Figure 2B). The first 
three components, shown in Figure 2B, explain only 27% of the total variance. This is in good accord- 
ance with earlier results where it was shown that about 16% of all analyzed genes showed photosyn- 
thesis type related expression changes when the transcriptomes of the C 4 species F trinervia and 
F bidentis, the C 3 species F robusta and F pringlei and the C 3 -C 4 intermediate species F ramosissima 
were compared (Gowik et aL, 2011). 

Amounts of photorespiratory transcripts and proteins indicate that the 
C 2 pathway was established early during C 4 evolution in Flaveria and is 
present also in the C 4 -like species R brownii 

Photorespiratory genes are expressed in all species and photorespiratory proteins are detected in all 
species. To visualize the differences in transcript and protein abundance heat maps were plotted 
{Figure 3). The transcription of all photorespiratory genes except the transport proteins DIT1 and DIT2 
and one isoform of GLDH was downregulated in the C 4 species F bidentis and F trinervia compared 
to the C 3 species F pringlei und F robusta {Figure 3A, Figure 3 — source data 1). Both dicarboxylate 
transporters play an important role in generating the transfer acids in the C 4 pathway of NADP-ME 
plants such as F trinervia and F bidentis {Renne et aL, 2003] Gowik et aL, 2011; Kinoshita et aL, 
2011). This may explain why their expression pattern is more similar to the C 4 genes than to the other 
photorespiratory genes. 

The amounts of photorespiratory transcripts did not decrease gradually from C 3 to C 4 but the 
expression levels in the C 3 -C 4 intermediate species F chloraefolia, F. pubescens, F anomala and 
F ramosissima were mostly equal or higher than in the C 3 species. An exception are the transcripts of 
one GLDP, one GLDH and one SHM isoform which are drastically down-regulated also in the C 3 -C 4 
intermediate species. It was shown earlier that the down-regulation of this GLDP isoform is tightly 
associated with the establishment of the C 2 pathway in Flaveria {Schulze et aL, 2013). The down- 
regulation of the GLDH and SHM isogenes might have similar reasons since both enzymes are also 
involved in glycine decarboxylation. Only the C 4 -like species F brownii is intermediate with respect to 
photorespiratory transcripts. 1 9 of 27 transcripts are reduced compared to the C 3 -C 4 intermediate and 
C 3 species but have higher levels than the true C 4 species F bidentis and F trinervia. Exceptions are 
the components of the glycine decarboxylase complex as the respective transcripts levels are equal to 
these in the C 3 and C 3 -C 4 intermediate species {Figure 3A). 

The expression patterns described above were not only found for the genes encoding the core 
enzymes of photorespiration but also for the genes responsible for recycling of ammonia set free 
during photorespiration, GS/GOGAT. Also the genes of recently discovered transporters associated 
with photorespiration, PLGG1 and BOU {Eisenhut et aL, 2013; Pick et aL, 2013), behave accordingly. 

To test whether transcript abundance reflects protein abundance, amounts of core photorespira- 
tory proteins in the leaves of all nine species were quantified by protein gel blots. To this end we 
generated antibodies against conserved peptides from Flaveria GLDP, GLDT, GLDL, SHM, HPR, 
PGLP and GLYK proteins. Total leaf proteins were extracted from plant material harvested together 
with the material used for RNA isolation and equal amounts of protein were separated via SDS gel- 
electrophoresis prior to blotting {Figure 3 — figure supplement 1). The changes of protein amounts 
essentially reflected the changes of the amounts of the corresponding transcripts {Figure 3B, 
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Figure 2. Variation of transcript profiles of the individual Flaveha species between the four experiments. (A) Hierarchical sample clustering of all 

expressed transcripts. The tree was calculated with the MEV program using the HCL module with Pearson correlation and the average linkage method. 

(B) Principal component analysis of transcript levels. The first three components explain 27% of the total variance. 

DOI: 10.7554/el_ife.02478.004 

The following source data are available for figure 2: 

Source data 1. Results of the lllumina sequencing and cross species read mapping. 
DOI: 10.7554/el_ife.02478.005 

Source data 2. Quantitative information for all reads mapped in a cross species approach onto the reference transcriptome from Arabidopsis thaliana. 
DOI: 10.7554/el_ife.02478.006 



Figure 3 — figure supplement 2, Figure 3 — source data 2). The amounts of core photorespiratory 
proteins in the C 3 -C 4 intermediates were equal to the amounts in the C 3 species. A clear reduction of 
these proteins can be observed only for the true C 4 species and the C 4 — like species. F brownii exhibits 
intermediate amounts of most photorespiratory proteins. This indicates that the regulation of pho- 
torespiratory genes mainly occurs on the transcriptional level and that our approach to analyze the 
photorespiratory activity by comparative transcriptomics is reasonable. 

While the overall patterns remain similar between all independent experiments, individual proteins 
and transcripts vary between the four experiments. This likely reflects adjustments of photorespiratory 
gene expression to the different light and temperature conditions in our green house in the different 
seasons of the year. 

We conclude that the four experiments support the establishment of a photorespiratory C 2 cycle 
early during C 4 evolution in Flaveria and that this C 2 cycle was maintained until Rubisco activity was 
constricted to the bundle sheath cells in the true C 4 Flaveria species. 

An integrated model of C 2 photosynthesis 

While the principal physiological differences between C 3 and C 4 leaves are widely understood, knowledge 
about the metabolic reconfiguration required to implement a functional C 2 pathway into a C 3 leaf is 
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Figure 3. Abundance of photorespiratory transcripts and proteins in leaves of individual Flaveria species. Normalized transcript (A) and protein 

(B) amounts are plotted as heat maps. Transcript amounts were determined by lllumina sequencing of the leaf transcriptomes and read mapping on 

selected F. robusta full length transcript sequences. Protein amounts were determined by protein gel blots. See Figure 3 — source data 1 for absolute 

transcript levels, Figure 3 — source data 2 for protein quantification and Figure 3 — figure supplements 1 and 2 for immunoblots. Fp: F. pringlei (C 3 ); 

Fro: F. robusta (C 3 ); Fc: F. chloraefolia (C 3 -C 4 ); Fpu: F. pubescens (C 3 -C 4 ); Fa: F. anomala (C 3 -C 4 ); Fra: F. ramosissima (C 3 -C 4 ); Fbr: F. brownii (C 4 -like); 

Fb: F. bidentis (C 4 ); Ft: F. trinervia (C 4 ). 

DOI: 10.7554/el_ife.02478.007 

The following source data and figure supplements are available for figure 3: 

Source data 1. Transcript abundance of photorespiratory genes determined by read mapping on F. robusta full length transcript sequences. 
DOI: 10.7554/el_ife.02478.008 

Source data 2. Quantification of photorespiratory proteins by protein gel blot. 
DOI: 10.7554/el_ife.02478.009 

Figure supplement 1. Results of the protein analyses. 
DOI: 10.7554/el_ife.02478.010 

Figure supplement 2. Results of the protein analyses. 
DOI: 10.7554/el_ife.02478.011 



incomplete. In particular, moving glycine from mesophyll to bundle sheath cells (Hylton et al., 1988; 
Morgan et al., 1993) does not only translocate carbon, it also transports one nitrogen atom per two 
carbon atoms. Evidently, implementing the C 2 carbon pump requires balancing of metabolic routes to 
maintain homeostasis of both carbon and nitrogen metabolism (Monson and Rawsthorne, 2000). How 
this can be achieved is non-intuitive and it thus requires a systematic analysis by metabolic modeling. 
To this end, we simulated the leaf metabolism of a C 2 plant using an integrated model. We coupled a 
mechanistic model of C 3 -C 4 intermediate photosynthesis (von Caemmerer, 2000; Heckmann eta/., 2013) 
with a modified genome-scale stoichiometric model of C 4 photosynthesis that was designed to describe 
the entire metabolic interactions of mesophyll and bundle sheath cells in C 4 leaves (Dal'Molin eta/., 2010). 
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We used the mechanistic model to predict constraints for the stoichiometric model. It provided 
values for net C0 2 uptake, Rubisco carboxylation as well as oxygenation in mesophyll and bundle 
sheath, C0 2 leakage from the bundle sheath, PEPC activity in the mesophyll, activity of NADP-ME in 
the bundle sheath, plasmodesmatal flux of glycine and serine, and decarboxylation by the GDC. Given 
specific activities of the C 2 and C 4 cycles in the mechanistic model, we used flux balance analysis (FBA) 
to predict detailed flux distributions that follow biologically realistic optimality criteria (Varma and 
Palsson, 1994). We employed a maximization of leaf biomass production, followed by a minimization 
of the sum of absolute fluxes including transport processes. In the minimization of total flux, we 
allocated higher weights to plasmodesmatal fluxes in order to account for the trade-off between C0 2 
leakage and diffusion of metabolites between the cells. This framework allows us to investigate the 
most parsimonious implementation of C 2 and C 4 cycles, given a hypothesis about which metabolites 
are suitable for plasmodesmatal transport. 

The first outcome of simulating the photorespiratory C0 2 pump was that the establishment of the 
C 2 pathway has indeed a direct impact on the nitrogen metabolism of the leaf. It transports two mol- 
ecules of glycine from the mesophyll to the bundle sheath, where one molecule each of serine, C0 2 , 
and ammonium are produced. C0 2 is fixed by bundle sheath Rubisco and serine is transferred back to 
the mesophyll, where it is used for the regeneration of phosphoglycerate and photorespiratory gly- 
cine. This results in a net transport of C0 2 but also ammonia from the mesophyll to the bundle sheath. 
To create a noticeable C0 2 enrichment in the bundle sheath, the C 2 cycle must run with an appreciable 
capacity; indeed, the mechanistic model of C 3 -C 4 intermediate photosynthesis predicted an oxygena- 
tion rate of Rubisco of about one third of its carboxylation rate. Running at such rates, the C 2 cycle will 
create a massive nitrogen imbalance between mesophyll and bundle sheath cells, as was also pre- 
dicted earlier by Monson and Rawsthorne (2000). Within the stoichiometric model, the free diffusion 
of ammonia between the two cell types was not allowed, since ammonia is toxic and known to effec- 
tively uncouple electrochemical gradients {Krogmann eta/., 1959). Thus, ammonia must be refixed in 
the bundle sheath cells and transferred back to the mesophyll in the form of amino acids. According 
to the intergrated model, ammonia is fixed by glutamine synthetase and glutamine oxoglutarate ami- 
notransferase (GS/GOGAT) in the bundle sheath cells {Figure 4). Consistent with this prediction, we 
found that GS/GOGAT transcripts were upregulated in the C 3 -C 4 intermediate species {Figure 3). 

Estimating whether a certain metabolite is suitable for maintaining a diffusional gradient between 
mesophyll and bundle sheath is an unsolved problem. The impact on regulatory mechanisms and home- 
ostasis of the C 3 leaf may render some metabolites unsuitable to serve as transport metabolites. We 
address this problem by modeling multiple scenarios that assume different transport metabolites. 

If major amino acids and the corresponding oxoacids and dicarbonic acids are allowed to freely 
diffuse between cells in an integrated model representing a C 2 cycle, glutamate is predicted to be 
transferred to the mesophyll, where it is deaminated by GGT, regenerating the photorespiratory gly- 
cine. The resulting 2-oxoglutarate is transferred back to the bundle sheath cells {Figure 4A). The 
model preference for glutamate/2-oxoglutarate reflects the minimization of total flux in the FBA 
model, as this effectively minimizes the number of active enzymatic reactions and holds the plas- 
modesmatal flux for ammonia balance at one acceptor and one transport metabolite. 

To elucidate if alternative solutions exist that contain more steps but retain the same biomass out- 
put, the 2-oxoglutarate transfer between mesophyll and bundle sheath was constrained to prevent the 
glutamate/2-oxoglutarate exchange. The integrated model then predicts an alanine/pyruvate shuttle 
{Figure 4B). The glutamate produced by GS/GOGAT activity in the bundle sheath cells is used by ala- 
nine aminotransferase (Ala-AT) to aminate pyruvate. The resulting alanine is transferred to the meso- 
phyll and trans-aminated by Ala-AT resulting in pyruvate and glutamate. The glutamate is used to 
regenerate photorespiratory glycine and pyruvate is transferred back to the bundle sheath. 

If alanine and pyruvate transfer are also constrained, the model predicts an aspartate/malate shuttle 
{Figure 4C). This includes the oxidation of malate in the bundle sheath. The resulting oxaloacetate 
(OAA) is aminated by aspartate aminotransferase (Asp-AT) and aspartate moves to the mesophyll. 
Here aspartate is trans-aminated by Asp-AT and malate is regenerated by reduction of the resulting 
OAA and transferred to the bundle sheath. 

In all these scenarios, further increasing the weights on plasmodesmatal flux leads to transporter 
metabolites with increased N carrying capacity such as asparagine {Figure 4 — source data 1). 

In a restrictive scenario, all nitrogen containing compounds were excluded from plasmodesmatal 
transport, except for glycine and serine, which are used by the C 2 cycle itself. In this case, the model 
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Figure 4. Flux Balance Analysis of the C 2 photosynthetic pathway. Predicted fluxes if (A) major amino acids and the corresponding oxoacids and 
dicarbonic acids are allowed to freely diffuse between cells, (B) the a-ketoglutarate and glutamate transfer between mesophyll and bundle sheath was 
constrained (C) additionally the transfer of alanine and pyruvate between mesophyll and bundle sheath was constrained (D) transfer of all nitrogen 
containing compounds except for glycine and serine, which are used by the C 2 cycle were constrained. Fluxes are given in umol s" 1 irr 2 . Values in 
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(A): 17.8 umol s _1 rrr 2 ; (B): 18.4 umol s" 1 irr 2 ; (C): 19.0 umol s" 1 irr 2 ; (D): 22.1 umol s _1 rrr 2 . See Figure 4 — source data 1 for plasmodesmatal fluxes. 
DOI: 10.7554/el_ife.02478.012 
The following source data are available for figure 4: 

Source data 1 . Fluxes over plasmodesmata depending on the weight on plasmodesmatal fluxes including flux variability analysis. 
DOI: 10.7554/el_ife.02478.013 



predicts that bundle sheath derived ammonia is transferred from glutamate to phosphohydroxy- 
pyruvate by phosphoserine aminotransferase to yield phosphoserine; phosphoserine is then con- 
verted to serine by phosphoserine phosphatase. Finally, the serine moves to the mesophyll. This 
variant includes the transfer of 3-phosphoglycerate from the mesophyll to the bundle sheath, where it 
is converted to phosphohydroxy pyruvate by 3-phosphoglycerate dehydrogenase {Figure 4D). 

The model predicts a mechanistic interaction between C 2 and C 4 cycle 

In C 3 plants, basal activities of the typical C 4 cycle enzymes are present {Aubry et al., 2011). When our 
integrated model is parameterized to include an active C 4 cycle, it predicts that a contingent of the 
bundle sheath ammonia will be transferred to the mesophyll cells by the C 4 cycle as a biomass neutral 
alternative to the 2-OG/Glu shuttle or as the unique solution when additional weight on plasmodes- 
matal fluxes is applied {Figure 5 — source data 1). In this solution malate is decarboxylated in the 
bundle sheath cells. C0 2 is refixed by Rubisco, and the resulting pyruvate is aminated by Ala-AT. 
Alanine moves to the mesophyll cells, where ammonia is fed into the photorespiratory cycle by Ala-AT 
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and GGT. The resulting pyruvate is converted back to malate by PPDK, PEPC, and NADPH-dependent 
MDH {Figure 5A). Flux variability analysis shows that only marginal variability in the fluxes of the 
shuttle is possible (Figure 5 — source data 1). According to our model predictions, the cycle is active 
even at low PEPC activities, such as those measured in C 3 Flaveria species (Gowik et al, 2011; 
Heckmann et al., 2013). When the C 4 cycle runs with low capacity, according the model, the surplus 
of bundle sheath ammonia is transferred back to the mesophyll by the glutamate/2-oxoglutarate 
shuttle. Once the capacity of the C 4 cycle gradually increases, the recirculation of nitrogen is shifted 
from the glutamate/2-oxoglutarate shuttle towards the C 4 cycle {Figure 5B). The predicted biomass 
production increases linearly with C 4 cycle activity {Figure 5C). Thus, our model predicts a strong 
interaction between C 2 and C 4 photosynthesis. 

Analysis of C 4 cycle gene expression in C 3 -C 4 intermediate Flaveria 
species 

When the C 2 cycle is running with high capacity, our integrated modeling approach predicts the neces- 
sity of auxiliary metabolite fluxes between mesophyll and bundle sheath cells to prevent a massive 
nitrogen imbalance. Among those auxiliary fluxes were the pyruvate/alanine and the malate/aspartate 
exchanges. The metabolites used in these shuttles also serve as transport metabolites in C 4 photosyn- 
thesis. Furthermore, the model highlights the possibility that a low capacity C 4 cycle balances part of 
the C 2 cycle ammonia production. Therefore we analyzed in detail the expression of C 4 cycle related 
genes in our dataset. True C 4 Flaverias, such as F bidentis or F trinervia, are believed to use a NADP-ME 
type C 4 cycle (Moore et al, 1984, Ku et al, 1991, Meister et al, 1996, Gowik et al, 2011). All 
genes associated with this type of C 4 photosynthesis are gradually upregulated in the analyzed C 3 -C 4 
intermediate species in line with their degree of 'C 4 -ness'. This is true for the typical C 4 enzymes like 
PEPC, PPDK, MDH, NADP-ME, Ala-AT and a plastidic aspartate aminotransferase (Asp-AT), as well as 
for several C 4 associated transporters, such as the pyruvate transporter BASS2, the HVNa + exchanger 
NHD, the PEP translocator CUE1 and the putative malate and aspartate transporters DIT1 and DIT2 
{Weber and von Caemmerer, 2010', Brautigam et al, 2011; Furumoto et al, 2011; Gowik et al, 
2011). The regulators of the C 4 enzymes (like PEPC kinase or the PPDK regulatory protein) and 
enzymes with auxiliary functions of C 4 enzymes (like pyrophosphatases or adenosinmonophosphatases) 
show a similar pattern {Figure 6, Figure 6 — source data 1). To corroborate the results of the transcript 
abundance measurements, selected C 4 cycle enzymes (PEPC, PPDK, and NADP-ME) were measured 
by immunoblotting. The protein abundance correlates well with the transcript abundance {Figure 6, 
Figure 6 — source data 2). 

The expression changes of C 4 cycle genes do not all follow the same quantitative pattern {Figure 6Q. 
Although all of these genes gradually increase in expression when plants gain C 4 properties, as judged, 
for example, by the percentage of 14 C0 2 directly fixed into C 4 acids {Vogan and Sage, 2011), the 
quantitative changes in gene expression are quite different. PEPC and PPDK transcript amounts 
increase slowly in the C 3 -C 4 intermediates F chloraefolia, F pubescens, and F anomala, more steeply 
in the advanced C 3 -C 4 intermediate F ramosissima and the C 4 -like species F brownii before reaching 
the highest transcript abundances in the true C 4 species {Figure 6Q. In contrast, NADP-ME and Ala-AT 
gene expression already increase in expression in the more C 3 -like intermediate species. Their expres- 
sion rises more linearly in the further advanced intermediates and plateaus in the C 4 -like and C 4 spe- 
cies. If one uses the different Flaveria species as evolutionary proxies as suggested by the results of 
Heckmann et al (2013), these results suggest that NADP-ME and Ala-AT are strongly upregulated 
earlier in evolution than other C 4 core enzymes like PEPC or PPDK. 

F chloraefolia is classified as a type I C 3 -C 4 intermediate species, and no enhanced C 4 cycle activity 
should be present in this species based on the classification. We detected upregulation of all NADP-ME 
type associated C 4 genes, with some of the genes showing comparable small increases in expression 
{Figure 6). This is in line with the results of 14 C0 2 uptake studies that indicate about 14% of C0 2 is 
directly incorporated into C 4 acids in F chloraefolia, whereas only 6% goes into C 4 acids directly in the 
C 3 species F pringlei {Moore et al, 1987). We think therefore that a basal C 4 cycle activity is present 
in F chloraefolia and its classification as type I C 3 -C 4 intermediate is questionable. 

A gene encoding a mitochondrial NAD dependent malate dehydrogenase as well as several cyto- 
solic and especially one mitochondrial Asp-AT were upregulated exclusively in the C 3 -C 4 intermediate 
species and the C 4 -like F brownii {Figure 6). Often, high activities of these genes are associated 
with the NAD-ME or PEP-CK type of C 4 photosynthesis. NAD dependent malic enzyme and PEP 
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Figure 5. Continued 

bundle sheath to mesophyll cells (gray dashed line) at low C 4 cycle activities. (C) Changes in biomass production with varying (low) activity of the C 4 cycle 
in a C 2 plant. 

DOI: 10.7554/el_ife.02478.014 

The following source data are available for figure 5: 

Source data 1 . Fluxes over plasmodesmata depending on the weight on plasmodesmatal fluxes including flux variability analysis. 
DOI: 10.7554/el_ife.02478.015 



carboxykinase genes were only very lowly expressed in all analyzed Flaverias, and no obvious differ- 
ences between the C 3 -C 4 intermediates and the other species could be found (Figure 2 — source 
data 2, data available from the Dryad Digital Repository: http://dx.doi.org/10.5061/dryad.q827h). 

We found no transcriptomic evidence that ammonia is recirculated by the phosphoserine pathway 
predicted by the model that restricts the free diffusion of all amino acids except serine and glycine. 
The amounts of transcripts for all three enzymes of this pathway, i.e., phosphoserine aminotransferase, 
phosphoserine phosphatase, and 3-phosphoglycerate dehydrogenase, were found to be very low in 
all analyzed Flaveha species {Figure 2 — source data 2, data available from the Dryad Digital Repository: 
http://dx.doi.org/10.5061/dryad.q827h) {Mallmann et aL, 2014). 

Taken together, these data imply that the anaplerotic ammonia shuttle, required to maintain the 
nitrogen homeostasis in mesophyll and bundles sheath cells of plants performing C 2 photosynthe- 
sis, is active in all analyzed C 3 -C 4 Flaveha species, as predicted by the computer simulations. 
Furthermore, it appears that even the most C 3 -like C 3 -C 4 intermediate species analyzed within the 
present study, F chloraefolia, exhibits low level C 4 cycle activity. This activity is again in accordance 
with the in silico model, which predicts the C 4 cycle to be a highly efficient ammonia recirculation 
pathway. 

Discussion 

Photorespiration is mainly seen as a wasteful process, which arises from a malfunction of Rubisco and 
reduces photosynthetic efficiency {Ogren, 1984). In a high C0 2 atmosphere, Rubisco can operate 
efficiently. But the current atmospheric C0 2 concentration, combined with heat and drought, leads to 
an enhanced oxygenase activity and thereby the photosynthetic efficiency decreases {Raines, 2011). 
Up to 30% of the initially fixed C0 2 may be lost by photorespiration {Bauwe et aL, 2010). C 4 plants 
avoid this problem by enriching C0 2 at the site of Rubisco. C0 2 is prefixed in the mesophyll and 
released in the bundle sheath cells, where Rubisco is operating {Hatch, 1987). The establishment of 
the photorespiratory C0 2 pump, which relocates the release of photorespiratory C0 2 to the bundle 
sheath cells, appears to be an important intermediate step towards the C 4 cycle and our detailed study 
of Flaveria intermediate species suggests that genes associated with C 4 photosynthesis also played a 
role in the C 2 cycle. 

Implementation of the C 2 pathway leads to high expression of 
photorespiratory genes in C 3 -C 4 intermediate Flaveria species 

The expression of photorespiratory genes, including all genes encoding the core enzymes of the 
pathway, most of the transporters, and the enzymes involved in ammonia refixation, is not downregu- 
lated in the analyzed intermediate species; the transcript and protein amounts remain constant or in 
some cases are even higher compared to C 3 species. A significant drop in photorespiratory gene 
expression is only observed in the C 4 -like species F brownii and is decreased further in the C 4 species. 
Together with earlier results {Schulze et aL, 2013), this indicates that indeed a C 2 photosynthetic cycle 
is active in all these C 3 -C 4 intermediate Flaveha species and that a reduction in photorespiratory tran- 
scripts and proteins only occurs once the amounts of Rubisco have been reduced in the mesophyll as 
was described for the C 4 -like species F brownii {Bauwe, 1984; Holaday et aL, 1988). Rubisco reduc- 
tion in the mesophyll is thus a late step of C 4 evolution, which in the Flaveria series appears to not 
occur gradually but rather abruptly towards the end of the evolutionary trajectory. It is followed by a 
strong increase of C 4 cycle activity, as can be deduced from the upregulation of PEPC and PPDK genes 
in the real C 4 species {Figure 6C), when the primary C0 2 fixation is completely taken over by PEPC. 
In the intermediate species C 2 and C 4 cycles operate in parallel leading to similar or higher photores- 
piratory gene expression compared with the C 3 species. 



Mallmann et a/. eLife 2014;3:e02478. DOI: 10.7554/el_ife.02478 



12 of 23 



eLIFE 



Research article 



experiment 1 



experiment 2 



experiment 3 



Genomics and evolutionary biology | Plant biology 



experiment 4 



q. i: o a. co ir n n lo. l u a ro l ^ 



PEPC 
PPDK 
NADP-ME 
cp-NADP-MDH 
AlaAT 
AspAT5 
BASS2 
NHD 
CUE1 
PPT2 
TPT 
D»1 
Dit2 
AMK2 
AMK1 
PPA2 
PPA6 
PEPC-K 
PPDK-RP 
m-NAD-MDH 
AspATI 
AspAT2 
AspAT3 
AspAT4 
NAD_ME1 
NAD_ME2 
NAD_ME3 
cp-NAD-MDH 
PEP-CK1 
PEP-CK2 




0 0 relative abundance 100 0 

I I C 3 species | species containing C 2 cycle 



| C 4 species 



Figure 6. Abundance of C 4 related transcripts and proteins in leaves of individual Flaveria species. Normalized transcript (A) and protein (B) levels are 
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Figure 6. Continued on next page 
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Figure 6. Continued 

Figure 6 — source data 2 for protein quantification and Figure 3 — figure supplement 1 for immunoblots. (C) Mean values of transcript levels from all 
four experiments were clustered by hierarchical using the HCL module of MEV program with Pearson correlation and the average linkage method. 
The relative transcript abundance for PEPC, PPDK, NADP-ME and Ala-AT (mean values from all four experiments) are plotted for all nine species. 
Fp: F. pringlei (C 3 ); Fro: F. robusta (C 3 ); Fc: F. chloraefoiia (C 3 -C 4 ); Fpu: F. pubescens (C 3 -C 4 ); Fa: F. anomala (C 3 -C 4 ); Fra: F. ramosissima (C 3 -C 4 ); 
Fbr: F. brownii (C 4 — like); Fb: F. bidentis (C 4 ); Ft: F. trinervia (C 4 ). 
DOI: 10.7554/el_ife.02478.016 

The following source data and figure supplements are available for figure 6: 

Source data 1 . Transcript abundance of C 4 cycle genes determined by read mapping on F. robusta full length transcript sequences. 
DOI: 10.7554/el_ife.02478.017 

Source data 2. Quantification of C 4 proteins by protein gel blots. 
DOI: 10.7554/el_ife.02478.018 

Figure supplement 1. Results of the protein analyses. 
DOI: 10.7554/el_ife.02478.019 



Analysis of C 4 cycle gene expression supports the predictions of the 
C 2 model for C 3 -C 4 intermediate Flaveria species and implies the early 
establishment of a complete C 4 pathway 

The model of the C 2 cycle and the underlying metabolism proposes GS/GOGAT, Ala-AT, and Asp-AT 
to be involved in balancing the amino groups during C 2 cycle operation (Figure 4). The transcriptome 
data from the C 3 -C 4 intermediate Flaveria species largely support the results of our integrated model 
for the C 2 pathway {Figure 6). In these species we found an upregulation of genes involved in the three 
most likely mechanisms for the recovery of ammonia predicted by the model. GS/GOGAT, which cata- 
lyzes the primary refixation of ammonia in the bundle sheath cells, is important for all three versions of 
ammonia shuttles (Figure 4) and is upregulated in the intermediate species. Transcripts for the 
glutamate/2-oxoglutarate shuttle, the alanine/pyruvate shuttle, and the aspartate/malate shuttle are 
enriched in all C 3 -C 4 intermediates compared to the C 3 and C 4 Flavehas. For the alanine/pyruvate 
shuttle, Ala-AT is needed in the bundle sheath and the mesophyll cells. Ala-AT is upregulated already 
in the least advanced C 3 -C 4 intermediates F. chloraefoiia and F. pubescens, but also in all the other 
C 3 -C 4 intermediates. Ala-AT transcripts are also highly abundant in the true C 4 Flavehas since Ala-AT 
is directly involved in the C 4 cycle when alanine is used as transport metabolite. 

We found several Asp-AT and two MDH genes upregulated in the C 3 -C 4 intermediate species 
(Figure 6). The chloroplast-located MDH and Asp-AT genes are involved in the C 4 cycle of C 4 Flavehas, 
in which malate and aspartate are used concurrently as C 4 transport metabolites (Meister et a/., 
1996). Two further Asp-AT genes and another MDH gene were found to be upregulated exclusively in 
the C 3 -C 4 intermediates, including the C 4 -like species F. brownii. The most likely reason for upregula- 
tion of these genes is their involvement in the recirculation of photorespiratory ammonia by a malate/ 
aspartate shuttle. 

The pathways of ammonia recirculation between mesophyll and bundle sheath foreshadow the 
establishment of a true C 4 cycle (Figure 4). All variants described above need the establishment of 
inter- and intra-cellular transport capacities for amino acids and small organic acids, which are also 
needed for a functional C 4 cycle (Weber and von Caemmerer, 2010). The existence of an aspartate/ 
malate and an alanine/pyruvate shuttle anticipates important components of a functional C 4 pathway. 
Our transcript data imply that both of these shuttles are active in C 3 -C 4 intermediate Flavehas. Only a 
few additions would be required to convert these pathways of ammonia recirculation into a C 4 -like C0 2 
concentration mechanism, that is, malate would have to be decarboxylated in the bundle sheath cells 
and pyruvate would have to be converted to malate in the mesophyll. Our transcript data implies that 
this conversion of the photorespiratory ammonia recirculation pathway into a C 4 -like C0 2 concentrat- 
ing pump must have been an early event in C 4 evolution of Flaveria since already in the least advanced 
intermediates such as F. chloraefoiia and F. pubescens, NADP-ME transcripts are elevated and their 
amounts increase in parallel with Ala-AT and Asp-AT transcript levels. 

To extend the pathways of ammonia recirculation into a rudimentary C 4 cycle, a capacity to regen- 
erate malate from pyruvate in the mesophyll is required. As deduced from the transcriptome data, the 
enzymatic functions required are also already enhanced in the least advanced C 3 -C 4 intermediates, 
since we observe a low but consistent upregulation of PEPC and PPDK genes in these species 
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compared to the C 3 Flavehas. Measurements of radiolabeled C0 2 incorporation support the view 
that a rudimentary C 4 cycle is already operating in intermediate Flaveha species {Rumpho eta/., 1984; 
Monson et aL, 1986; Moore et aL, 1987; Ch a stain and Chollet, 1989). F chloraefolia as well as 
F pubescens incorporate a higher percentage of 14 C0 2 into the C 4 compounds malate and aspartate 
(11.3% and 24.9%) than the C 3 species F pringlei and F cronquistii (4.1% and 7.7%) {Vogan and Sage, 
2011). Thus even the least advanced intermediates analyzed in this study run already a low-level C 4 
cycle, which assists in recycling the ammonia liberated by GDC in the bundle sheath cells. 

The question arises whether amino group transfer initially exclusively happened via amino acid/ 
oxoacid pairs or whether the enzymatic content of C 3 plants immediately supported a shuttle that also 
involved decarboxylation and carboxylation reactions. C 3 plants have considerable capacity for the 
decarboxylation of four-carbon organic acids in their bundle sheath cell (Hibberd and Quick, 2002; 
Brown et aL, 2010) and measurements of total leaf NAD-ME and NADP-ME activity in C 3 plants re- 
peatedly demonstrated basal activities for various C 3 species (Wheeler et aL, 2005; Aubry et aL, 
2011; Maier et aL, 2011). C 3 plants also accumulate high amounts of organic C 4 acids like malate 
or fumarate during the day (Ze// et aL, 2010), which are produced by PEPC, the only enzyme capable 
of producing C 4 acids de novo. It is tempting to hypothesize that plants use a malate/alanine shuttle 
to recycle parts of the ammonia liberated by glycine decarboxylation from the very beginning of the 
C 2 cycle. 

Elevating the C 4 cycle activity in a C 2 plant enhances the C0 2 fixation 
capacity 

If the C 4 cycle is superimposed onto a C 2 cycle operating in a C 3 -C 4 intermediate plant, the C 2 photo- 
synthesis model predicts a mechanistic interaction between the C 2 and C 4 cycles (Figure 5). When the 
C 4 cycle is running, the photorespiratory ammonia is recirculated from the bundle sheath to the mes- 
ophyll cells by moving malate from the mesophyll to the bundle sheath and transferring alanine back 
to the mesophyll. This malate/alanine cycling leads to a net transport of ammonia from the bundle 
sheath into the mesophyll cells. In contrast to the other mechanisms of ammonia recirculation described 
above, the C 4 cycle does not only lead to a net transport of ammonia from the bundle sheath to the 
mesophyll but additionally also to a net transport of C0 2 in the opposite direction. Thus C0 2 is trans- 
ferred from the mesophyll to the bundle sheath without increasing the number of transport processes 
between the cells. By elevating the C0 2 concentration in the bundle sheath cells the C 4 cycle acts 
cooperatively with the C 2 cycle. The bundle sheath Rubisco would work under a more elevated C0 2 
concentration and thus operate more effectively compared to a pure C 2 plant, leading to an increased 
biomass production. The C 4 cycle thus has a dual beneficial effect: an efficient nitrogen shuttle is com- 
bined with a C0 2 concentrating pump. 

To investigate the possible interaction with regard to biomass, a C 4 cycle at the enzyme capacities 
of C 3 plants was allowed and tested for biomass changes (Figure 5 Q. When the C 4 cycle is running 
with PEPC activities comparable to those found in C 3 Flaveha species, the model already predicts a 
gain in biomass production compared to the C 2 cycle on its own. Under these conditions, the bulk 
of photorespiratory ammonia is recycled through a rudimentary C 4 cycle limited by the C 4 cycle flux 
capacity. The model predicts that biomass production will be further enhanced with higher activity of 
the C 4 cycle. Consequently, there is permanent positive selection on enhancing the activity of the cur- 
rently rate limiting enzyme once a C 4 cycle is running. 

The evolutionary scenario described above is in good agreement with the Flaveha transcriptome 
data. We observe gradual increases in the amounts of C 4 transcript with increasing 'C 4 -ness' of the 
C 3 -C 4 intermediates until the most advanced species F brownii. The abundance of NADP-ME and 
Ala-AT transcripts increases faster than the transcript abundance of the other core C 4 genes like PEPC, 
PPDK, MDH or Asp-AT. This implies that these evolutionary changes were driven by selection on 
high bundle sheath decarboxylation capacity, consistent with the idea that the C 4 cycle began as an 
auxiliary pathway to the C 2 cycle to recirculate photorespiratory ammonia. Hence, in this early phase, 
the main purpose of the C 4 cycle was to provide the ammonia acceptor pyruvate. The C 2 model and 
its evolutionary implications are consistent with the properties of the C 3 -C 4 intermediate Flaveha 
species including F brownii, which possess mesophyll Rubisco activity and consequently the C 2 photo- 
synthetic pathway. The next iteration during C 4 evolution in Flaveha must have been the restriction 
of Rubisco activity to the bundle sheath, making the C 2 cycle obsolete, as observed for the true 
C 4 Flaveha species. 
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A scenario for C 4 evolution in the genus Flaveria — a general blueprint 
for the evolution of C 4 photosynthesis? 

The establishment of a photorespiratory C0 2 pump, termed C 2 photosynthesis, is thought to be an 
important step in C 4 evolution. Recent work has shown how C 3 Flaverias were preconditioned for 
the evolution of the C 2 pathway and how the C 2 cycle was implemented on the molecular level {Sage 
et aL, 2013] Schulze et aL, 2013). Together with the present work, this gives us a detailed picture of 
what happened in the early and intermediate stages during C 4 evolution in Flaveria. 

We have argued that the establishment of the C 2 cycle requires the implementation of at least 
components of the C 4 pathway, if not the whole pathway. This fact might be a partial explanation for 
the polyphyletic evolution of C 4 photosynthesis. Only the C 2 cycle has to evolve to set a system on a 
slippery slope towards C 4 photosynthesis. Nature seems to confirm this idea. So far, 66 independent 
origins of C 4 photosynthesis could be identified. In contrast, there are only seven known groups 
with independent origins of C 2 plants and no direct ancestry to C 4 species (Sage et aL, 2012). If one 
assumes that all recent C 4 lineages evolved via C 2 intermediates, which appears likely (Sage et aL, 
2012] Heckmann et aL, 2013] Williams et aL, 2013), this would mean that the C 2 pathway evolved 
73 times independently and that over 90% of these C 2 plant containing lineages proceeded to 
C 4 photosynthesis. This indicates that the C 2 photosynthetic pathway must indeed be a strong 
enabler of C 4 photosynthesis. It will be highly enlightening to analyze these C 2 groups without 
ancestry to C 4 species, like Moricandia, Steinchisma or Mollugo, to find out in how far they differ 
from groups that evolved the C 4 pathway and why C 4 evolution may have been hampered in these 
groups. 

The close evolutionary interconnection of the C 2 and the C 4 pathway could be seen as an example 
of metabolic exaptation (Barve and Wagner, 2013). Exaptation or pre-adaptation was defined as an 
adaptation involving the co-option of traits that originally evolved for a different purpose (Gould and 
Vrba, 1982). While both C 2 and C 4 act as carbon shuttles to the bundle sheath cells, the two systems 
achieve this goal through different biochemical processes. In particular, the amino acid shuttle in the 
C 2 system evolved to transport nitrogen, and its later use in C 4 photosynthesis to shuttle carbon thus 
represents a molecular exaptation. Our findings therefore corroborate the general idea that the evo- 
lution of complex traits may be accelerated through exaptations (Darwin, 1872] Gould and Vrba, 
1982] Barve and Wagner, 2013). 

We do not know if the scenario on the early and intermediate stages of evolution described above 
is limited to the genus Flaveria or if it is valid for C 4 evolution in general. Our prediction of the C 2 path- 
way being a strong facilitator of C 4 evolution should apply to all C 4 origins, as the integrated model is 
not specific to Flaveria. 

Materials and methods 
Plant material 

F pringlei, F. robusta, F. chloraefolia, F pubescens, F anomala, F ramosissima, F brownii, F. bidentis 
and F. trinervia plants were grown in the green house at University of Duesseldorf side-by-side and 
harvested at four different points of time over the year. The plants were grown in 17-cm pots on soil 
(C-400 with Cocopor [Stender Erden, Schermbeck Germany] fertilized with 3 g/l Osmocote exact 
standard 3 to 4 M [Marysville, USA]) with additional light for 1 6 hr per day until 50 to 60 cm height and 
before the onset of flowering. 

Plants for experiment one were harvested in September, for experiment two in June, for experi- 
ment three in October and for experiment four in April. The plant material was immediately frozen in 
liquid nitrogen, stored at -80°C and used for the following analyses. 

RNA isolation, transcriptome sequencing and analysis 

Total RNA was isolated from the second and fourth leaves according to (Westhoff et aL, 1991) fol- 
lowed by a DNAse treatment. After phenol/chloroform extraction and precipitation with NaAc and 
isopropyl alcohol the RNA was dissolved in H 2 0. The RNA quality was tested with the Agilent 2100 
bioanalyzer. 1 ug of total RNA was used for cDNA library generation, which was accomplished with the 
TruSeq RNA Sample Preparation Kit (lllumina Inc., San Diego, USA) via the Low-Throughput Protocol 
(TruSeq RNA Sample Preparation Guide, lllumina Proprietary Catalog # RS-930-2001 , Part # 1 50081 36 
Rev. A, November 2010). Clusters were generated with the TruSeq SR Cluster Kit v2 according to the 
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Reagent Preparation Guide with the lllumina cBot device. The single read sequencing was performed 
with the lllumina HiSeq2000. 

Sequences of transcripts from genes involved in photorespiraton, C 4 photosynthesis and refixation 
and recirculation of photorespiratory ammonia were identified among de novo assembled transcripts 
of F. robusta. De novo assembly was performed with either CLC Genomics Workbench (CLC-Bio, 
Aaarhus, Denmark) or the Velvet/Oases software package {Schulz et aL, 2012) using F robusta 454 
{Gowik et aL, 2011) and lllumina reads (this study). 

After quality control and processing, lllumina reads were aligned to the F. robusta transcript 
sequences with the CLC Genomics Workbench using standard parameters. Read mapping against a 
minimal set of coding sequences (Brautigam et aL, 2011) of the TAIR 9 release of the Arabidopsis 
thaliana genome (http://www.Arabidopsis.org/) was performed using BLAT (Kent, 2002) as described 
in [Gowik et aL, 2011). 

The MEV software package (http://www.tm4.org/mev.html) was used for plotting heat maps, hier- 
archical clustering and principal component analysis. 

Protein isolation and quantification 

Total proteins were isolated from plant material harvested together with the material for RNA isolation 
according to Shen et aL (2007) and quantified using the RC-DC protocol (Bio-Rad Laboratories, 
Hercules, USA). 30 ug of total protein was electrophoresed on polyacrylamide-SDS gels (Schagger 
and von Jagow, 1987) and electrophoretically transferred to nitrocellulose membranes (Protran BA85, 
0.45 jiym; Schleicher & Schuell, Dassel, Germany) for 1 hr with 0.8 mA per cm 2 . Specific primary anti- 
bodies were raised against conserved Flaveria peptides (Agrisera Vannas, Sweden). For the detection 
of specific proteins the nitrocellulose membranes were incubated with the primary antibodies and a 
Horseradish peroxidase-conjugated secondary antibody (Sigma-Aldrich, St. Louis, USA). An enhanced 
chemiluminescent Horseradish peroxide substrate was added and signals were recorded using a Fuji 
LAS-4000 mini CCD camera system. The signals were quantified with the Multi Gage analysis software 
(Fujifilm, Tokyo, Japan). As loading control a gel was stained for 45 min with 0.25% Coomassie blue, 
50% methanol, 7% acetic acid, and destained in 50% methanol, 7% acetic acid. 

Coupling a mechanistic model with a genome-scale metabolic 
reconstruction 

In order to model the metabolic integration of C 2 and C 4 cycle in the context of leaf metabolism, we 
conducted Flux Balance Analysis (FBA) based on a genome-scale metabolic reconstruction of C 4 metab- 
olism, C4GEM (Dal'Molin et aL, 2010). This reconstruction contains a complex biomass reaction 
including carbohydrates, cell wall components, amino acids and nucleotides {Dal'Molin et aL, 2010). 

FBA is a powerful tool to understand the adaptation of metabolism on a genomic scale. Since 
metabolite concentrations are not modeled explicitly, fluxes related to carbon concentration mecha- 
nisms (CCMs) cannot be captured by this constraint-based approach alone. To account for this issue, 
we coupled the FBA model with a mechanistic model of C 3 -C 4 photosynthesis (von Caemmerer, 
2000] Heckmann et aL, 2013). 

C4GEM representing NADP-ME types was provided by the authors and FBA was conducted using 
this model: 

Maximize c T v 

subject to Sv = 0. 

VminJ ^ Vj < V max j 

where c is the vector of coefficients in the objective function, here the leaf biomass production, v is the 
vector of fluxes through the network reactions, S is the stoichiometric matrix of the metabolic network, 
and v min and v max represent constraints on the respective fluxes. 

In order to test hypotheses concerning nitrogen metabolism in C 3 -C 4 intermediate plants, S had 
to be modified. The plasmosdesmatal transport reactions in the original C4GEM model include malate, 
pyruvate, 3-phosphoglycerate, trioses, phosphates, sucrose, aspartate, alanine, phosphoenolpyru- 
vate, C0 2 , and 0 2 . Reactions were added to S in order to include transport of serine, glycine, gluta- 
mate, glutamine, asparagine, threonine, 2-oxoglutarate and water over the mesophyll/bundle sheath 
interface. Furthermore, the lack of photosystem II in the bundle sheath of certain C 4 plants does not 
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hold in our scenario (Nakamura et aL, 2013) and we added a reaction for linear electron transport to 
the bundle sheath. C4GEM does not contain a reaction for a plastidal NADP-dependent malate dehy- 
drogenase in the bundle sheath; we added this reaction to S. 

In addition to the stoichiometric matrix S, the constraints used in C4GEM were modified: 
The original constraint on leaf sucrose production was changed to result in an output ratio of 
sucrose to amino acids of about 5 (R/ens et aL, 1991). Fixed constraints on production of starch and 
fatty acids are not appropriate in the coupled framework. Since we are not aware of data that explains 
how these fluxes scale with net C0 2 assimilation rate, the constraints were removed from the model. 
Reactions belonging to the GS/GOGAT system were assumed to be irreversible. Nitrogen is available 
in the form of nitrate as opposed to NH 3 in the original model. Since there is no evidence suggest- 
ing mesophyll specificity of PEPC in intermediate Flaveria species, we unconstrained PEPC flux in the 
bundle sheath. 

To couple the genome-scale FBA model with the mechanistic model of carbon fixation, the fol- 
lowing reactions were constrained using the values predicted by the mechanistic model: net C0 2 
uptake, Rubisco carboxylation and oxygenation in mesophyll and bundle sheath, C0 2 leakage from 
the bundle sheath, PEPC activity in the mesophyll, activity of NADP-ME in the bundle sheath, plas- 
modesmatal flux of glycine and serine and decarboxylation by the GDC complex. The lower bound 
on glycine diffusion (V m]njG i y ), serine diffusion (V min s er ), and GDC reaction {V m]r)iGDC ) can be obtained from 
the rate of Rubisco oxygenation in the mesophyll (V om ) and the fraction of photorespiratory C0 2 in 
the bundle sheath derived from mesophyll oxygenations (£): 

v min,Gly ~ ^om. ^min,Ser ~ 0.5^V om , V m / n ,GDC = 0.5^V om 

The mechanistic model was parameterized to the C 3 state as given in Heckmann et a\. (2013), 
with the exception of the parameter which was set to a value of 0.98 {i.e., the majority of GDC 
activity was restricted to the bundle sheath. Derivation from transcriptome data is given in Heckmann 
et a\. (2013)). These constraints on the reactions of the photorespiratory pump are necessary to 
adequately predict C 2 photosynthesis because of the inability of FBA alone to model CCMs (see 
discussion above). 

In the FBA part of the model, a minimization of total flux (MTF) analysis was conducted in order to 
narrow down the space of optimal solutions: 

n 

Minimize ^ w,-|v;| 

;=1 

subject to: Sv = 0. 

Vm/'n,/' — V/ — Vmax,i 

c T v = c Vba 

where v FBA is the flux distribution of the FBA optimization described above, w denotes a vector of 
weights, where plasmodesmatal flux received a higher weighting factor (1.1 for plasmodesmatal 
exchange, 1 for the remaining reactions). This method implements a simple minimization of protein 
costs for a given optimal biomass production. The higher weights on plasmodesmatal fluxes account 
for the trade-off between C0 2 containment in the bundle sheath and metabolite diffusion between the 
cells. Since this trade-off is difficult to quantify, we conducted a sensitivity analysis by varying the 
weight on plasmodesmatal transport reactions. 

In order to investigate the possible range that fluxes can take while yielding an optimal solution, 
flux variability analysis was conducted: 

For each v,: 

Maximize or Minimize v,. 
Subject to: Sv = 0. 

Vm/'n,/' — V/ — Vmax,i 

c T v = cVba 
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n 

E W/|V/| = s opt 
/=1 

Where s opt is the minimum for the weighted sum of absolute flux found in the MTF optimization. 
All simulations were conducted in the R environment for statistical computing (R Core Team, 2013) 
using the sybil library {Gelius-Dietrich et aL, 2013). 

Accession numbers 

The read data have been submitted to the National Center for Biotechnology Information Short 
Read Archive under accession numbers SRP036880 (F bidentis), SRP036881 (F anomala), SRP036883 
(F brownii), SRP036884 {F. chloraefolia), SRP036885 {F. phnglei), SRP037526 {F pubescens), SRP037527 
{F ramosissima), SRP037528 (F robusta) and SRP037529 {F thnervia). 
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